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[57] ABSTRACT 

A hybrid compiler-interpreter comprising a compiler 



for "compiling" source program code, and an inter- 
preter for interpreting the "compiled" code, is provided 
to a computer system. The compiler comprises a code 
generator that generates code in intermediate form with 
data references made on a symbolic basis. The inter- 
preter comprises a main interpretation routine, and two 
data reference handling routines, a dynamic field refer- 
ence routine for handling symbolic references, and a 
static field reference routine for handling numeric refer- 
ences. The dynamic field reference routine, when in- 
voked, resolves a symbolic reference and rewrites the 
symbolic reference into a numeric reference. After re- 
writing, the dynamic field reference routine returns to 
the main interpretation routine without advancing pro- 
gram execution to the next instruction, thereby allow- 
ing the rewritten instruction with numeric reference to 
be reexecuted. The static field reference routine, when 
invoked, obtain data for the program from a data object 
based on the numeric reference. After obtaining data, 
the static field reference routine advances program 
execution to the next instruction before returning to the 
interpretation routine. The main interpretation routine 
selectively invokes the two data reference handling 
routines depending on whether the data reference in an 
instruction is a symbolic or a numeric reference. 

10 Claims, 5 Drawing Sheets 
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guage interpreter that translates source code into inter- 
METHOD AND APPARATUS FOR RESOLVING mediate form code and references data in the manner 
DATA REFERENCES IN GENERATED CODE described above is the BASIC interpreter. 

The "interpreted" approach avoids the problems 
BACKGROUND OF THE INVENTION 5 encountered with the "compiled" approach, when a 

1. Field of the Invention program is constructed in pieces. However, because of 
The present invention relates to the field of computer the extra level of interpretation at execution time, each 

systems, in particular, programming language compilers time an instruction comprising a symbolic reference is 
and interpreters of these computer systems. More spe- interpreted, execution is slowed significantly, 
cifically, the present invention relates to resolving refer- 10 Thus, it is desirable if programming languages can be 
ences in compiler generated object code. implemented in a manner that provides the execution 

2. Background performance of the "compiled" approach, and at the 
The implementation of modern programming lan- same ^ flexibility 0 f the ''interpreted" approach 

guages, including object oriented programming lan- for data Qbj ^ ^ 

guages, are generally grouped into two categories: com- ^ prograins to be compiled. As will be disclosed, 

P In rcomp^Vogramming language, a computer * e P res f mv ff on P rovides * ; method .^apparatus 

program (called a compiler) compiles the source pro- resolymg data references m compiler generated 

gram and generates executable code for a specific com- code that achieves the desired results, 

puter architecture. References to data in the generated 20 SUMMARY OF THE INVENTION 
code are resolved prior to execution based on the layout 

of the data objects that the program deals with, thereby, A method and apparatus for generating executable 

allowing the executable code to reference data by their code and resolving data references in the generated 

locations. For example, consider a program that deals code is disclosed. The method and apparatus provides 

with a point data object containing two variables x and 25 execution performance substantially similar to the tradi- 

y, representing the x and y coordinates of a point, and tional compiled approach, as well as the flexibility of 

further assume that the variables x and y are assigned altering data objects like the traditional interpreted 

slots 1 and 2 respectively, in each instance of the point approach. The method and apparatus has particular 

data object. Thus, an instruction that accesses or fetches application to implementing object oriented program- 

y, such as the Load instruction 14 illustrated in FIG. 1, 30 m f n g languages in computer systems. 

is resolved to reference the variable y by the assigned Under the present invention, a hybrid compiler-inter- 

slot 2 before the instruction sequence is executed. Par- f compsking a compiler for "compiling" source 

ticular examples of programming language compilers CQd ^ ^ mte eter for mterpretmg th e 

that generate code and resolve data references in the JI * «, _ , • ^-Jl a ^ „ ° ^ 

manner described above include C and C+ + compil- 35 « >m P ,led , "** ■ P"""** t0 a 

erg r The compiler comprises a code generator that generates 

This "compiled" approach presents problems when a code m intermediate form with data references made on 
program is constructed in pieces, which happens fre- a symbolic basis. The interpreter comprises a main in- 
quently under object oriented programming. For exam- terpretation routine, and two data reference handling 
pie, a program may be constructed from a library and a 40 routines, a static field reference routine for handling 
main program. If a change is made to the library, such numeric references and a dynamic field reference rou- 
that the layout of one of the data objects it implements tine for handling symbolic references. The dynamic 
is changed, then clients of that library, like the main field reference routine, when invoked, resolves a sym- 
program, need to be recompiled. Continuing the pre- bolic reference and rewrites the symbolic reference into 
ceding example, if the point data object had a new field 45 a numeric reference. After rewriting, the dynamic field 
added at the beginning called name, which contains the reference routine returns to the interpreter without 
name of the point, then the variables x and y could be advancing program execution to the next instruction, 
reassigned to slots 2 and 3. Existing programs compiled thereby allowing the rewritten instruction with numeric 
assuming that the variables x and y and are in slots 1 and reference to be reexecuted. The static field reference 
2 will have to be recompiled for them to execute cor- 50 routine , w hen invoked, obtain data for the program 
rectly. f rom a object based on the numeric reference. 

In an interpreted language, a computer program After obtaioing ^ ^ static fie i d reference routine 
(called a translator) translates the source statements of a ^vances program execution to the next instruction 
program into some mtermeo^te fonn, typically inde- before retiirning to the interpreter. The main interpreta- 
pendent of any computer instruction set References to 55 selectively invokes the two data reference 

data in the intermediate form are not fully resolved " " * T u * \7 ZT , . - 

before execution based on the layout of the data objects handlm S routmes pending on whether the data refer- 
that the program deals with. Instead, references to data ****** instruction is a symbolic or a numeric refer- 
are made on a symbolic basis. Thus, an instruction that t , . J . 

accesses or fetches y, such as the Load instruction 14' 60 Asa result > • compiled intermediate form object 
illustrated in FIG. 1, references the variable y by the of a program achieves execution performance 

symbolic name "y"- The program in intermediate form substantially similar to that of the traditional compiled 
is executed by another program (called an interpreter) object code, and yet it has the flexibility of not having to 
which scans through the code in intermediate form, and he recompiled when the data objects it deals with are 
performs the indicated actions. Each of the symbolic 65 altered like that of the traditional translated code, since 
references is resolved during execution each time the data reference resolution is performed at the first execu- 
instruction comprising the symbolic reference is inter- tion of a generated instruction comprising a data refer- 
preted. A particular example of a programming lan- ence. 



05/06/2004, EAST version: 1.4.1 



PREFERRED AND ALTERNATE 
EMBODIMENTS 



5,367,685 

3 4 

BRIEF DESCRIPTION OF THE DRAWINGS ^^^T^ a T 

om^±^mt **u- x^xvn. u^^o systems. The constitutions and basic functions of these 

The objects, features, and advantages of the present elements are well known and wDl not be farther de- 
invention will be apparent from the following detailed scribed here. 

description of the presently preferred and alternate 5 As shown in FIG. 3 V the software elements of the 

embodiments of the invention with references to the exemplary computer system of FIG. 2 comprises an 

drawings in which: operating system 36, a hybrid compiler-interpreter 38 

FIG. 1 shows the prior art compiled approach and incorporated with the teachings of the present inven- 

the prior art interpreted approach to resolving data tion, and applications compiled and interpreted using 

reference. 10 the hybrid compiler-interpreter 38. The operating sys- 

FIG. 2 illustrates an exemplary computer system tern 36 and the applications 40 are intended to represent 

incorporated with the teachings of the present inven- a broad categories of these software elements found in 

tion. many computer systems. The constitutions and basic 

FIG. 3 illustrates the software elements of the exem- functions of these elements are also well known and will 

plary computer system of FIG. 2. IS not be described further. The hybrid compiler-inter- 

FIG. 4 illustrates one embodiment of the compiler of preter 38 will be described in further detail below with 

the hybrid compiler-interpreter of the present inven- references to the remaining figures, 

tion. Referring now to FIGS. 4 and 5, two block diagrams 

FIG. 5 illustrates one embodiment of the code gener- illustrating the compiler of the hybrid compiler-inter- 

ator of the compiler of FIG. 4. 20 preter of the present invention are shown. Shown in 

FIG. 6 illustrates one embodiment of the interpreter FIG. 4 is one embodiment of the compiler 42 compris- 

and operator implementations of the hybrid compiler- ing a lexical analyzer and parser 44, an intermediate 

interpreter of the present invention. representation builder 46, a semantic analyzer 48, and a 

FIG. 7 illustrates the cooperative operation flows of code generator 50. These elements are sequentially 
the main interpretation routine, the static field reference 25 coupled to each other. Together, they transform pro- 
routine and the dynamic field reference routine of the gram source code 52 into tokenized statements 54, inter- 
present invention. mediate representations 56, annotated intermediate rep- 

FIG. 8 illustrates an exemplary resolution and rewrit- resentations 58, and ultimately intermediate form code 

ing of a data reference under the present invention. 60 with data references made on a symbolic basis. The 

DETAILED DESCKIETIOX PRESENTLY » J—-5SS EttZSZSSSZ 

intended to represent a broad category of these ele- 
ments found in most compilers. The constitutions and 
A method and apparatus for generating executable basic functions of these elements are well known and 
code and resolving data references in the generated 35 will not be otherwise described further here. Similarly, 
code is disclosed. The method and apparatus provides a variety of well known tokens, intermediate represen- 
execution performance substantially similar to the tradi- tations, annotations, and intermediate forms may also be 
tional compiled approach, as well as the flexibility of used to practice the present invention, 
altering data objects like the traditional interpreted As shown in FIG. 5, the code generator 50 comprises 
approach. The method and apparatus has particular 40 a main code generation routine 62, a number of compli- 
application to implementing object oriented program- mentary operator specific code generation routines for 
ming languages. In the following description for pur- handling the various operators, such as the ADD and 
poses of explanation, specific numbers, materials and the IF code generation routines, 64 and 66, and a data 
configurations are set forth in order to provide a thor- reference handling routine 68. Except for the fact that 
ough understanding of the present invention. However, 45 generated coded 60 are in intermediate form and the 
it will be apparent to one skilled in the art that the pres- data references in the generated code are made on a 
ent invention may be practiced without the specific symbolic basis, the main code generation routine 62, the 
details. In other instances, well known systems are operator specific code generation routines, e.g. 64 and 
shown in diagrammatical or block diagram form in 66, and the data reference handling routine 68, are in- 
order not to obscure the present invention unnecessar- 50 tended to represent a broad category of these elements 
ily. found in most compilers. The constitutions and basic 
Referring now to FIGS. 2 and 3, two block diagrams functions of these elements are well known and will not 
illustrating an exemplary computer system incorporated be otherwise described further here, 
with the teachings of the present invention are shown. For further descriptions on various parsers, interme- 
As shown in FIG. 2, the exemplary computer system 20 55 diate representation builders, semantic analyzers, and 
comprises a central processing unit (CPU) 22, a mem- code generators, see A. V. Aho, R. Sethi, and J. D. 
ory 24, and an I/O module 26. Additionally, the exem- UUman, Compilers Principles, Techniques and Tools, Ad- 
plary computer system 20 also comprises a number of dison-Wesley, 1986, pp. 25-388, and 463-512. 
input/output devices 30 and a number of storage de- Referring now to FIGS. 6 and 7, two block diagrams 
vices 28. The CPU 22 is coupled to the memory 24 and 60 illustrating one embodiment of the interpreter of the 
the I/O module 26. The input/output devices 30, and hybrid compiler-interpreter of the present invention 
the storage devices 28 are also coupled to the I/O mod- and its operation flow for handling data references is 
ule 26. The I/O module 26 in turn is coupled to a net- shown. As shown in FIG. 6, the interpreter 70 corn- 
work 32. prises a main interpretation routine 72, a number of 
Except for the manner they are used to practice the 65 complimentary operator specific interpretation rou- 
present invention, the CPU 22, the memory 24, the I/O tines, such as the ADD and the IF interpretation rou- 
module 26, the input/output devices 30, and the storage tines, 74 and 76, and two data reference interpretation 
devices 28, are intended to represent a broad category routines, a static field reference routine (SFR) and a 
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dynamic field reference routine (DFR), 78 and 80. The 
main interpreter routine 72 receives the byte codes 82 of 
the intermediate form object code as inputs, and inter- 
prets them, invoking the operator specific interpretation 
routines, e.g. 74 and 76, and the data reference routines, 5 
78 and 80, as necessary. Except for the dynamic field 
reference routine 80, and the manner in which the main 
interpretation routine 72 and the state field reference 
routine 78 cooperates with the dynamic field reference 
routine 80 to handle data references, the main interpre- 10 
tation routine 72, the operator specific interpretation 
routines, e.g. 74 and 76, and the static field reference 
routine 78, are intended to represent a broad category of 
these elements found in most compilers and interpreters. 
The constitutions and basic functions of these elements 15 
are well known and will not be otherwise described 
further here. 

As shown in FIG. 7, upon receiving a data reference 
byte code, block 86, the main interpretation routine 
determines if the data reference is static, i.e. numeric, or 20 
dynamic i.e. symbolic, block 88. If the data reference is 
a symbolic reference, branch 886, the main interpreta- 
tion routine invokes the dynamic field reference rou- 
tine, block 90. Upon invocation* the dynamic field refer- 
ence routine resolves the symbolic reference, and re- 25 
writes the symbolic reference in the intermediate form 
object code as a numeric reference, block 92. Upon 
rewriting the data reference in the object code, the 
dynamic field reference routine returns to the main 
interpretation routine, block 100, without advancing the 30 
program counter. As a result, the instruction with the 
rewritten numeric data reference gets reexecuted again. 

On the other hand, if the data reference is determined 
to be a numeric reference, branch 88a, the main inter- 
pretation routine invokes the static field reference rou- 35 
tine, block 94. Upon invocation, the static field refer- 
ence routine obtain the data referenced by the numeric 
reference, block 96. Upon obtaining the data, the static 
field reference routine advances the program counter, 
block 98, and returns to the main interpretation routine, 40 
block 100. 

Referring now to FIG. 8, a block diagram illustrating 
the alteration and rewriting of data references under the 
present invention in further detail is shown. As illus- 
trated, a data referencing instruction, such as the 45 
LOAD instruction 14", is initially generated with a 
symbolic reference, e.g. "y". Upon its first interpreta- 
tion in execution, the data referencing instruction, e.g. 
14, is dynamically resolved and rewritten with a nu- 
meric reference, e.g. slot 2. Thus, except for the first 50 
execution, the extra level of interpretation to resolve the 
symbolic reference is no longer necessary. Therefore, 
under the present invention, the "compiled" intermedi- 
ate form object code of a program achieves execution 
performance substantially similar to that of the tradi- 55 
tional compiled object code, and yet it has the flexibility 
of not having to be recompiled when the data objects it 
deals with are altered like that of the traditional trans- 
lated code, since data reference resolution is performed 
at the first execution of a generated instruction compris- 60 
ing a symbolic reference. 

While the present invention has been described in 
terms of presently preferred and alternate embodiments, 
those skilled in the art will recognize that the invention 
is not limited to the embodiments described. The 65 
method and apparatus of the present invention can be 
practiced with modification and alteration within the 
spirit and scope of the appended claims. The description 



is thus to be regarded as illustrative instead of limiting 
on the present invention. 
What is claimed is: 

1. In a computer system comprising a program in 
source code form, a method for generating executable 
code for said program and resolving data references in 
said generated code, said method comprising the steps 
of: 

a) generating executable code in intermediate form 
for said program in source code form with data 
references being made in said generated code on a 
symbolic basis, said generated code comprising a 
plurality of instructions of said computer system; 

b) interpreting said instructions, one at a time, in 
accordance to a program execution control; 

c) resolving said symbolic references to correspond- 
ing numeric references, replacing said symbolic 
references with their corresponding numeric refer- 
ences, and continuing interpretation without ad- 
vancing program execution, as said symbolic refer- 
ences are encountered while said instructions are 
being interpreted; and 

d) obtaining data in accordance to said numeric refer- 
ences, and continuing interpretation after advanc- 
ing program execution, as said numeric references 
are encountered while said instruction are being 
interpreted; 

said steps b) through d) being performed iteratively 
and interleavingly. 

2. The method as set forth in claim 1, wherein, said 
program in source code form is implemented in source 
code form of an object oriented programming language. 

3. The method as set forth in claim 2, wherein, said 
programming language is C. 

4. The method as set forth in claim 2, wherein, said 
programming language is C+ + . 

5. The method as set forth in claim 1, wherein, 
said program execution control is a program counter; 
said continuing interpretation in step c) is achieved by 

performing said step b) after said step c) without 
incrementing said program counter; and 
said continuing interpretation in said step d) is 
achieved by performing said step b) after said d) 
after incrementing said program counter. 

6. In a computer system comprising a program in 
source code form, an apparatus for generating execut- 
able code for said program and resolving data refer- 
ences in said generated code, said apparatus comprising: 

a) compilation means for receiving said program in 
source code form and generating executable code 
in intermediate form for said program in source 
code form with data references being made in said 
generated code on a symbolic basis, said generated 
code comprising a plurality of instructions of said 
computer system; 

b) interpretation means for receiving said generated 
code and interpreting said instructions, one at a 
time; 

c) dynamic reference handling means coupled to said 
interpretation means for resolving said symbolic 
references to corresponding numeric references, 
replacing said symbolic references with their cor- 
responding numeric references, and continuing 
interpretation by said interpretation means without 
advancing program execution, as said symbolic 
references are encountered while said instructions 
are being interpreted by said interpretation means; 
and 
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d) static reference handling means coupled to said 
interpretation means for obtaining data in accor- 
dance to said numeric references, and continuing 
interpretation by said interpretation means after 
advancing program execution, as said numeric 
references are encountered while said instruction 
are being interpreted by said interpretation means; 

said interpretation means, said dynamic reference 
handling means, and said static reference handling 



10 



8 



means performing their corresponding functions 
iteratively and interleavingly. 

7. The apparatus as set forth in claim 6, wherein, said 
program in source code form is implemented in source 
code form of an object oriented programming language. 

8. The apparatus as set forth in claim 7, wherein, said 
programming language is C 

9. The apparatus as set forth in claim 7, wherein, said 
programming language is C-h + . 

10. The apparatus as set forth in claim 6, wherein, 
said program execution control is a program counter. 
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